feat: support default checksums #1191

0marperez · 2024-11-27T10:53:55Z

Issue #

Description of changes

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

Signed-off-by: 0marperez <[email protected]>

…flexible-checksums

0marperez · 2024-11-27T16:01:36Z

Note to reviewers: I'll address the breaking API changes and failing protocol issues soon. In the meantime, I’d appreciate a review of the checksum-related changes.

lauzadis · 2024-11-27T19:19:09Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+ * Calculates a request's checksum.
 *
- * If the checksum will be sent as a header, calculate the checksum.
+ * If a user supplies a checksum via an HTTP header no calculation will be done. The exception is MD5, if a user


correctness/clarification: "Calculates a request's checksum" and "If a user supplies a checksum via an HTTP header no calculation will be done." conflict with each other

lauzadis · 2024-11-27T19:21:12Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+ *
+ * @param requestChecksumRequired Model sourced flag indicating if checksum calculation is mandatory.
+ * @param requestChecksumCalculation Configuration option that determines when checksum calculation should be done.
+ * @param userSelectedChecksumAlgorithm The checksum algorithm that the user selected for the request, may be null.


naming: requestChecksumAlgorithm

lauzadis · 2024-11-27T19:21:55Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+    private val forcedToCalculateChecksum = requestChecksumRequired || requestChecksumCalculation == HttpChecksumConfigOption.WHEN_SUPPORTED
+    private val checksumHeader = StringBuilder("x-amz-checksum-")
+    private val defaultChecksumAlgorithm = lazy { Crc32() }
+    private val defaultChecksumAlgorithmHeaderPostfix = "crc32"


Postfix -> Suffix

lauzadis · 2024-11-27T19:24:30Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+     * The header must start with "x-amz-checksum-" followed by the checksum algorithm's name.
+     * MD5 is not considered a valid checksum algorithm.
+     */
+    private fun userProviderChecksumHeader(request: HttpRequest, logger: Logger): String? {


style: can be restructured as an extension function HttpRequest.checksumHeader(logger: Logger): String?

and naming: userProvidedChecksumHeader

lauzadis · 2024-11-27T19:30:00Z

...mmon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptor.kt

 * Users can check which checksum was validated by referencing the `ResponseChecksumValidated` execution context variable.
 *
- * @param shouldValidateResponseChecksumInitializer A function which uses the input [I] to return whether response checksum validation should occur
+ * @param responseValidationRequired Flag indicating if the checksum validation is mandatory.


docs: "Model sourced flag"

lauzadis · 2024-11-27T19:40:29Z

...mmon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptor.kt

+            .removePrefix("x-amz-checksum-")
+            .toHashFunction() ?: throw ClientException("Could not parse checksum algorithm from header $checksumHeader")
+
+        if (context.protocolResponse.body is HttpBody.Bytes) {


Why was this branch added? The spec says we should delay checksum calculation until the body is consumed:

Where possible, SDKs MUST defer this calculation and validation until the payload is actually consumed by the user i.e. a payload must not be read twice.

toHashingBody doesn't support Bytes bodies. I decided to calculate the checksum in memory instead of adding support for Bytes to toHashingBody. It should be safe if the request is not being streamed but I think you're right that we're not following the spec

toHashingBody doesn't support Bytes bodies

Does it need to? The previous implementation never seemed to need it

I think it does, the spec has some unit tests with non-streaming bodies

Discussed offline and we'll be removing support for HttpBody.Bytes from FlexibleChecksumsResponseInterceptor because HTTP bodies are never bytes. Source

I think removing support for HttpBody.Bytes is a bad idea. Our current implementations of OkHttp and CRT engines do not return bytes, that's true, but future implementations or other engines wrapped by users may. In addition, interceptors may be used to rewrite response bodies by streaming them into memory, processing them, and then rewriting the body in a new form (possibly HttpBody.Bytes). I think we need to ensure each type of HttpBody is handled correctly.

lauzadis · 2024-11-27T19:41:28Z

.../test/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptorTest.kt

@@ -126,23 +133,6 @@ class FlexibleChecksumsRequestInterceptorTest {
        assertEquals(0, call.request.headers.getNumChecksumHeaders())
    }

-    @Test
-    fun itSetsChecksumHeaderViaExecutionContext() = runTest {


Why was this test removed?

We're no longer setting the checksum header via execution context

lauzadis · 2024-11-27T19:41:52Z

...test/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptorTest.kt

@@ -163,29 +168,4 @@ class FlexibleChecksumsResponseInterceptorTest {

        op.roundTrip(client, TestInput("input"))
    }
-
-    @Test
-    fun testSkipsValidationWhenDisabled() = runTest {


Why was this test removed?

This should be covered under ChecksumConfigTest > ResponseChecksumValidation but I added it back

lauzadis · 2024-11-27T19:43:48Z

...smithy-client/common/src/aws/smithy/kotlin/runtime/client/config/HttpChecksumClientConfig.kt

+
+public enum class HttpChecksumConfigOption {
+    /**
+     * SDK will create/validate checksum if the service marks it as required or if this is set.


docs: create -> calculate

lauzadis · 2024-11-27T19:54:55Z

.../test/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptorTest.kt

correctness: missing tests for when requestChecksumRequired = false and requestChecksumCalculation = HttpChecksumConfigOption.WHEN_REQUIRED. Same for response validation

The request test cases are already here. I think the response validation unit tests need one more to be exhaustive.

Those tests in ChecksumConfigTest.kt aren't validating interceptor behavior like the tests in this file do.

ianbotsf · 2024-11-27T19:02:29Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

- * @param checksumAlgorithmNameInitializer an optional function which parses the input [I] to return the checksum algorithm name.
- * if not set, then the [HttpOperationContext.ChecksumAlgorithm] execution context attribute will be used.
+ * If the request will be streamed:
+ * - The checksum calculation is done asynchronously using a hashing & completing body.


Nit: "asynchronously" → "during transmission"

ianbotsf · 2024-11-27T19:03:02Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+ * - The checksum will be sent in a trailing header, once the request is consumed.
+ *
+ * If the request will not be streamed:
+ * - The checksum calculation is done synchronously


Nit: "synchronously" → "before transmission"

ianbotsf · 2024-11-27T19:31:01Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+    private val defaultChecksumAlgorithm = lazy { Crc32() }
+    private val defaultChecksumAlgorithmHeaderPostfix = "crc32"
+
+    private val checksumAlgorithm = userSelectedChecksumAlgorithm?.let {
+        val hashFunction = userSelectedChecksumAlgorithm.toHashFunction()
+        if (hashFunction == null || !hashFunction.isSupported) {
+            throw ClientException("Checksum algorithm '$userSelectedChecksumAlgorithm' is not supported for flexible checksums")
+        }
+        checksumHeader.append(userSelectedChecksumAlgorithm.lowercase())
+        hashFunction
+    } ?: if (forcedToCalculateChecksum) {
+        checksumHeader.append(defaultChecksumAlgorithmHeaderPostfix)
+        defaultChecksumAlgorithm.value
+    } else {
+        null
+    }


Nit: Seems unnecessary to declare forcedToCalculateChecksum, defaultChecksumAlgorithm, and defaultChecksumAlgorithmHeaderPostfix as fields only to use them in one if branch. Could this just be:

?: if (requestChecksumRequired || requestChecksumCalculation == HttpChecksumConfigOption.WHEN_SUPPORTED) { checksumHeader.append("crc32") Crc32() } else

ianbotsf · 2024-11-27T19:35:19Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+        userProviderChecksumHeader(context.protocolRequest, logger)?.let {
+            logger.debug { "User supplied a checksum via header, skipping checksum calculation" }


Style: These log messages which talk about the user feel awkward. Log messages are for users so it seems strange to mention the user in the third person. I'd suggest rewording these to be more about data/actions and less about the actors involved (e.g., "Found a provided checksum in the request, skipping checksum calculation").

ianbotsf · 2024-11-27T19:47:52Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

-
-                deferredChecksum.complete(checksum)
-            } else {
-                logger.debug { "Calculating checksum asynchronously" }


Nit: We've lost the log message that we're calculating a checksum ~~asynchronously~~ during transmission. I'd suggest moving your "Calculating checksum using '$checksumAlgorithm'" message into the if/else clauses, tweaking each to mention whether the checksum is being calculated before or during transmission.

ianbotsf · 2024-11-27T20:14:57Z

...mmon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptor.kt

+private fun String.isCompositeChecksum(): Boolean {
+    // Ends with "-#" where "#" is a number between 1-1000
+    val regex = Regex("-([1-9][0-9]{0,2}|1000)$")
+    return regex.containsMatchIn(this)
+}


Nit: When this logic is made S3 specific, we shouldn't assert on the part number fitting inside a certain range. Just "-(\d)+$" should suffice.

ianbotsf · 2024-11-27T20:16:29Z

.../test/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptorTest.kt

Nit: Please add tests for HttpChecksumConfigOption.WHEN_REQUIRED. Applies to response interceptors as well.

The request test cases are already here. I think the response validation unit tests need one more to be exhaustive.

ianbotsf · 2024-11-27T21:08:36Z

.../test/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptorTest.kt

-    @Test
-    fun itSetsChecksumHeaderViaExecutionContext() = runTest {
-        checksums.forEach { (checksumAlgorithmName, expectedChecksumValue) ->
-            val req = HttpRequestBuilder().apply {
-                body = HttpBody.fromBytes("<Foo>bar</Foo>".encodeToByteArray())
-            }
-
-            val op = newTestOperation<Unit, Unit>(req, Unit)
-            op.context[HttpOperationContext.ChecksumAlgorithm] = checksumAlgorithmName
-            op.interceptors.add(FlexibleChecksumsRequestInterceptor<Unit>())
-
-            op.roundTrip(client, Unit)
-            val call = op.context.attributes[HttpOperationContext.HttpCallList].first()
-            assertEquals(expectedChecksumValue, call.request.headers["x-amz-checksum-$checksumAlgorithmName"])
-        }
-    }


Question: Do we still use HttpOperationContext.ChecksumAlgorithm or can it be deprecated?

I think it's still used for MD5 checksums in the httpChecksumRequiredTrait

Update on this, it's no longer used anywhere so we can deprecate/remove it

ianbotsf · 2024-11-27T21:10:10Z

...test/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptorTest.kt

-    @Test
-    fun testSkipsValidationWhenDisabled() = runTest {
-        val req = HttpRequestBuilder()
-        val op = newTestOperation<TestInput>(req)
-
-        op.interceptors.add(
-            FlexibleChecksumsResponseInterceptor<TestInput> {
-                false
-            },
-        )
-
-        val responseChecksumHeaderName = "x-amz-checksum-crc32"
-
-        val responseHeaders = Headers {
-            append(responseChecksumHeaderName, "incorrect-checksum-would-throw-if-validated")
-        }
-
-        val client = getMockClient(response, responseHeaders)
-
-        val output = op.roundTrip(client, TestInput("input"))
-        output.body.readAll()
-
-        assertNull(op.context.getOrNull(ChecksumHeaderValidated))
-    }


Question: Why did we delete this test?

This should be covered under ChecksumConfigTest > ResponseChecksumValidation but I added it back

ianbotsf · 2024-11-27T21:13:01Z

...smithy-client/common/src/aws/smithy/kotlin/runtime/client/config/HttpChecksumClientConfig.kt

+    /**
+     * SDK will create/validate checksum if the service marks it as required or if this is set.
+     */
+    WHEN_SUPPORTED,


Nit: "or if this is set" → "or if the service offers optional checksums"

…flexible-checksums

lauzadis · 2024-12-19T18:39:58Z

codegen/protocol-tests/build.gradle.kts

+    // FIXME: Re-enable. This test is broken after a smithy update: https://github.com/smithy-lang/smithy/pull/2467
+    // ProtocolTest("aws-json-10", "aws.protocoltests.json10#JsonRpc10"),


FYI I know we talked about disabling some tests that are failing in Smithy 1.53.0, but these are disabling the entire test suite, which we don't want to do

lauzadis · 2024-12-19T18:43:42Z

...mithy-kotlin-codegen/src/main/kotlin/software/amazon/smithy/kotlin/codegen/model/ShapeExt.kt

@@ -32,7 +33,7 @@ import software.amazon.smithy.rulesengine.traits.EndpointTestsTrait
 * shape's closure for example)
 */
 @Suppress("EXTENSION_SHADOWED_BY_MEMBER")
-inline fun <reified T : Shape> Model.shapes(): List<T> = shapes(T::class.java).toList()
+inline fun <reified T : Shape> Model.shapes(): List<T> = shapes(T::class.java).kotlinToList()


This has never been a problem before, what changed?

There was a refactor in AWS SDK Kotlin to make the codegen tests KMP, which led to the JVM CI checks failing because we've been using JVM 17. I set the JVM version in the tests to 1.8 and the toList function used above is only available since JVM 16

Ok that makes sense. You should be able to use shapes(T::class.java).collect(Collectors.toList()) instead of hacking around with the Kotlin toList

lauzadis · 2024-12-19T18:44:10Z

...software/amazon/smithy/kotlin/codegen/rendering/checksums/HttpChecksumRequiredIntegration.kt

+import software.amazon.smithy.model.traits.HttpChecksumRequiredTrait
+
+/**
+ * Handles the `httpChecksumRequired` trait.


nit/docs: Explain more about how it "handles" the trait

The "meat" of the integration is the middleware and both of them have their own Kdocs. It feels like docs overkill to also add a summary of what each middleware is doing here

lauzadis · 2024-12-19T18:45:37Z

...software/amazon/smithy/kotlin/codegen/rendering/checksums/HttpChecksumRequiredIntegration.kt

+        writer.write(
+            "op.context[#T.DefaultChecksumAlgorithm] = #S",
+            RuntimeTypes.HttpClient.Operation.HttpOperationContext,
+            "MD5",


question/correctness: Can/should we store this as a HashFunction rather than a String?

We could, I think we would have to initialize the HashFunction in the context and its seems slightly worse than what we have right now. Thoughts?

lauzadis · 2024-12-19T18:48:49Z

...client/common/src/aws/smithy/kotlin/runtime/http/interceptors/AbstractChecksumInterceptor.kt

+
+/**
+ * @return The default checksum algorithm name, null if default checksums are disabled.
+ */
+internal fun defaultChecksumAlgorithmName(context: ProtocolRequestInterceptorContext<Any, HttpRequest>): String? =
+    context.executionContext.getOrNull(HttpOperationContext.DefaultChecksumAlgorithm)


correctness: This seems like the wrong place to define this function

Where would be better?

I would have said HttpChecksumRequiredInterceptor but I see it's also used by FlexibleChecksumsRequestInterceptor. I think it's fine here.

style: make it an extension val

internal val ProtocolRequestInterceptorContext<Any, HttpRequest>.defaultChecksumAlgorithmName: String? get() = executionContext.getOrNull(HttpOperationContext.DefaultChecksumAlgorithm)

lauzadis · 2024-12-19T19:12:13Z

runtime/protocol/http/common/src/aws/smithy/kotlin/runtime/http/HttpBody.kt

+/**
+ * Convert an [HttpBody] with an underlying [HashingSource] or [HashingByteReadChannel]
+ * to a [CompletingSource] or [CompletingByteReadChannel], respectively.
+ */
+@InternalApi
+public fun HttpBody.toCompletingBody(deferred: CompletableDeferred<String>): HttpBody = when (this) {


These used to be private, now they are @InternalApi public, meaning they are included in our API dumps and part of backwards compatibility. Was this move necessary?

Oops, I moved this here because I was going to add support for streaming checksums in the HttpCheksumsRequired interceptor

lauzadis · 2024-12-19T19:12:59Z

runtime/runtime-core/common/src/aws/smithy/kotlin/runtime/hashing/HashFunction.kt

+ */
+@InternalApi
+public val HashFunction.isSupportedForFlexibleChecksums: Boolean get() =
+    algorithmsSupportedForFlexibleChecksums.contains(this::class.simpleName)


correctness: Relying on this::class.simpleName seems unsafe.

Better solution:

@InternalApi public val HashFunction.isSupportedForFlexibleChecksums: Boolean get() = when (this) { is Crc32, is Crc32c, is Sha1, is Sha256 -> true else -> false }

Yeah agree, that's what we had before but we would have to keep track of what checksum algorithms are supported for flexible checksums in two places. In HashFunction.isSupportedForFlexibleChecksums & algorithmsSupportedForFlexibleChecksums.

I made a tradeoff here, it seems more risky to possibly miss updating what algorithms we support in both locations than relying on the class name. Plus we have unit tests that should lessen some of our concerns here

The only other use of algorithmsSupportedForFlexibleChecksums is in a log message. I'd rather remove the log message than keep this class reflection

lauzadis · 2024-12-19T19:15:03Z

runtime/runtime-core/common/src/aws/smithy/kotlin/runtime/util/Concurrency.kt

+@InternalApi
+public fun runBlocking(block: suspend () -> Unit) {
+    runBlocking {
+        block()
+    }
+}


question: What is this used for?

It's so we can have runBlocking as a runtime type, it's used in the presigner generator

You can just add a new runBlocking type under the existing KotlinxCoroutines runtime type

lauzadis · 2024-12-19T19:15:27Z

...smithy-client/common/src/aws/smithy/kotlin/runtime/client/config/HttpChecksumClientConfig.kt

@@ -0,0 +1,40 @@
+package aws.smithy.kotlin.runtime.client.config


correctness: missing license at the top. same applies to Concurrency.kt

Thanks, I think we need to adjust our Ktlint for be more stringent when it comes to license headers

lauzadis · 2024-12-19T19:16:11Z

...smithy-client/common/src/aws/smithy/kotlin/runtime/client/config/HttpChecksumClientConfig.kt

+    }
+}
+
+public enum class HttpChecksumConfigOption {


docs: missing KDocs

Maybe we can add a new rule to Ktlint about Kdocs? Not sure it that's possible but I think I'd be helpful

lauzadis · 2024-12-19T19:30:47Z

...smithy-client/common/src/aws/smithy/kotlin/runtime/client/config/HttpChecksumClientConfig.kt

+    }
+}
+
+public enum class HttpChecksumConfigOption {


correctness: Right now request and response checksum configs both use this enum. In the future, their config options might diverge, which will force us to make an API break. I'd recommend splitting this into HttpChecksumRequestConfigOption and HttpChecksumResponseConfigOption now to get ahead of that.

It seems like just speculation for now, I say we only separate them if that time comes. Changing the type of this config option would be a breaking change for all SDKs. I think it's more likely that we would make additive changes

It's not speculative, there was some discussion during the spec review about adding more options to either request or response but not both.

The spec calls out RequestChecksumCalculation and ResponseChecksumValidation as separate config types, so adding a new config option to either of those wouldn't be a breaking change for SDKs which implement it as the spec says.

ianbotsf · 2025-01-09T17:33:20Z

...client/common/src/aws/smithy/kotlin/runtime/http/interceptors/AbstractChecksumInterceptor.kt

+/**
+ * Handles checksum calculation so that checksums will be cached during retry loop
+ */
 @InternalApi
 public abstract class AbstractChecksumInterceptor : HttpInterceptor {
    private var cachedChecksum: String? = null

    override suspend fun modifyBeforeSigning(context: ProtocolRequestInterceptorContext<Any, HttpRequest>): HttpRequest {
-        cachedChecksum ?: calculateChecksum(context).also { cachedChecksum = it }
-        return cachedChecksum?.let { applyChecksum(context, it) } ?: context.protocolRequest
+        cachedChecksum = cachedChecksum ?: calculateChecksum(context)
+
+        return if (cachedChecksum != null) {
+            applyChecksum(context, cachedChecksum!!)
+        } else {
+            context.protocolRequest
+        }
    }


Question: "checksums will be cached during retry loop"...is that what we want? One of the things which could occur between a failed first attempt and a subsequent retry is the body changing because, say, it's a file stream and the underlying file has been modified. In that case, we'd send the old checksum and the new file body?

It was originally added for performance reasons. There's a discussion about it here. In your example someone/something would have to modify the underlying file in a file stream between retries. I'm not sure how often that happens for our customers but I'm open to getting rid of it.

ianbotsf · 2025-01-09T17:42:52Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

        }
+
+        logger.debug { "Checksum wasn't provided, selected, or isn't required: skipping checksum calculation" }


Correctness: This message still fires in the case we're doing the chunked streaming hashing but it shouldn't.

ianbotsf · 2025-01-09T17:46:29Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+    /**
+     * Determines what checksum algorithm to use, null if none is required
+     */
+    private fun resolveChecksumAlgorithm(
+        requestChecksumRequired: Boolean,
+        requestChecksumCalculation: RequestHttpChecksumConfig?,
+        requestChecksumAlgorithm: String?,
+        context: ProtocolRequestInterceptorContext<Any, HttpRequest>,
+    ): HashFunction? =
+        requestChecksumAlgorithm
+            ?.toHashFunctionOrThrow()
+            ?.takeIf { it.isSupportedForFlexibleChecksums }
+            ?: context.defaultChecksumAlgorithmName
+                ?.toHashFunctionOrThrow()
+                ?.takeIf {
+                    (requestChecksumRequired || requestChecksumCalculation == RequestHttpChecksumConfig.WHEN_SUPPORTED) &&
+                        it.isSupportedForFlexibleChecksums
+                }


Comment: Nice fluid/functional style! 🤩

ianbotsf · 2025-01-09T17:53:40Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

            else -> {
-                val bodyBytes = req.body.readAll()!!
-                req.body = bodyBytes.toHttpBody()
+                val bodyBytes = req.body.readAll() ?: byteArrayOf()
+                if (req.body.isOneShot) req.body = bodyBytes.toHttpBody()
                bodyBytes.hash(checksumAlgorithm).encodeBase64String()
            }


Question: This is an interesting change. We're now replacing the request body only if it's a oneshot body. Replayable (multi-shot?) bodies don't get replaced and so we'll read them again later when transmitting the payload.

In theory that could be slightly more performant if the backing channel/source was already in memory because now we'd no longer be duplicating it. But I think it could also be slightly less performant if the backing channel/source has to be read from IO (disk, network, etc.) since we'd be doing that IO twice.

What led to this change? Was replayable stream performance a factor or was it something else?

Yeah, it seemed like we could improve performance a bit here but I hadn't considered what you said. I don't have a strong inclination towards keeping the optimization fwiw

ianbotsf · 2025-01-09T19:00:22Z

...mmon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptor.kt

    override suspend fun modifyBeforeDeserialization(context: ProtocolResponseInterceptorContext<Any, HttpRequest, HttpResponse>): HttpResponse {
-        if (!shouldValidateResponseChecksum) {
-            return context.protocolResponse
-        }
+        val logger = coroutineContext.logger<FlexibleChecksumsResponseInterceptor>()

-        val logger = coroutineContext.logger<FlexibleChecksumsResponseInterceptor<I>>()
+        val configuredToVerifyChecksum = responseValidationRequired || responseChecksumValidation == ResponseHttpChecksumConfig.WHEN_SUPPORTED
+        if (!configuredToVerifyChecksum) return context.protocolResponse


Nit: We don't need logger in the case of !configuredToVerifyChecksum and we exit early. We could move the initialization of it closer to the use site.

ianbotsf · 2025-01-09T19:02:06Z

...mmon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptor.kt

+        if (ignoreChecksum(serviceChecksumValue)) {
+            logger.info { "Checksum detected but validation was skipped." }
+            return context.protocolResponse
+        }


Nit: Putting the logging statement here means we can't provide more information about why validation was skipped. I think we should log in the subclass (IgnoreCompositeFlexibleChecksumResponseInterceptor) where we can definitely state the reason.

ianbotsf · 2025-01-09T19:10:57Z

...nt/common/src/aws/smithy/kotlin/runtime/http/interceptors/HttpChecksumRequiredInterceptor.kt

+        return when (val body = context.protocolRequest.body) {
+            is HttpBody.Bytes -> {
+                checksumAlgorithm.update(
+                    body.readAll() ?: byteArrayOf(),
+                )
+                checksumAlgorithm.digest().encodeBase64String()
+            }
+            else -> null // TODO: Support other body types
+        }


Question: Why the TODO here? Can't we just use HashingSource and HashingByteReadChannel to implement the other body types?

Yeah, we can use those types. A small refactor is needed and I think deadlines shifting left me uncertain if I could get this functionality supported on time. I think we might as well do it now since the refactor will be a breaking change

ianbotsf

Looks pretty good. Some minor feedback/suggestions...fix (or don't) and ship!

ianbotsf · 2025-01-13T18:04:28Z

...ommon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsRequestInterceptor.kt

+    /**
+     * Applies a checksum based on the requirements and limitations of [FlexibleChecksumsRequestInterceptor]
+     */
+    private fun applyFlexibleChecksumsChecksum(


Question: The names applyFlexibleChecksumsChecksum and calculateFlexibleChecksumsChecksum are pretty unwieldy. Given that these method are already in a class with "flexible checksums" in the name, it seems like we could just stick with the original names applyChecksum and calculateChecksum. Was there a reason to extract private helper methods with longer names here?

(Same question in HttpChecksumRequiredInterceptor.kt)

It's so we can share logic for caching and non-caching checksum calculation. I gave the helper functions longer names to avoid naming conflicts with the superclass functions called applyChecksum and calculateChecksum.

ianbotsf · 2025-01-13T18:13:10Z

...mmon/src/aws/smithy/kotlin/runtime/http/interceptors/FlexibleChecksumsResponseInterceptor.kt

-    public open fun ignoreChecksum(checksum: String): Boolean = false
+    public open fun ignoreChecksum(checksum: String, logger: Logger): Boolean = false


Suggestion: Instead of passing the logger, we could pass context: ProtocolResponseInterceptorContext<Any, HttpRequest, HttpResponse>. The full context which contains more information and may be useful to future subclasses which need more data to make a determination about ignoring a checksum. The logger would then be available to implementors as context.executionContext.coroutineContext.logger() (or ...coroutineContext.warn { ... }, etc.).

Good suggestion, let me do this before I merge

ianbotsf · 2025-01-13T18:21:10Z

...nt/common/src/aws/smithy/kotlin/runtime/http/interceptors/HttpChecksumRequiredInterceptor.kt

+            req.body.contentLength == null && !req.body.isOneShot -> {
+                val channel = req.body.toSdkByteReadChannel()!!
+                channel.rollingHash(checksumAlgorithm).encodeBase64String()
+            }


Question: Why does it matter if req.body.contentLength == null?

I'm assuming it's a requirement for toSdkByteReadChannel and rollingHash. This code is originally from here

Ah, this was discussed in #772 (comment). That helps add some context.

github-actions · 2025-01-15T02:14:12Z

Affected Artifacts

Significantly increased in size

Artifact	Pull Request (bytes)	Latest Release (bytes)	Delta (bytes)	Delta (percentage)
smithy-client-jvm.jar	66,614	62,681	3,933	6.27%
http-jvm.jar	108,671	103,477	5,194	5.02%

Changed in size

Artifact	Pull Request (bytes)	Latest Release (bytes)	Delta (bytes)	Delta (percentage)
http-client-jvm.jar	317,319	314,811	2,508	0.80%
runtime-core-jvm.jar	815,834	812,469	3,365	0.41%

Other approval available and discussed offline this is ok to merge

0marperez added 10 commits November 10, 2024 22:28

Bump smithy IDL version

2a9d104

Signed-off-by: 0marperez <[email protected]>

Add requestChecksumCalculation config option

205839c

Added responseChecksumValidation

7233b71

Add todos for business metrics

e1dc616

Unit tests pass

e436482

Merge branch 'main' of https://github.com/awslabs/smithy-kotlin into …

abdba02

…flexible-checksums

E2E tests pass

3e4c891

Self review

9760ee1

Self review 2

f8b39b0

Smithy codegen version bump

f676b7b

0marperez added the no-changelog Indicates that a changelog entry isn't required for a pull request. Use sparingly. label Nov 27, 2024

This comment has been minimized.

Sign in to view

lauzadis reviewed Nov 27, 2024

View reviewed changes

ianbotsf reviewed Nov 27, 2024

View reviewed changes

0marperez and others added 4 commits December 3, 2024 11:23

Make composite checksum check S3 specific

6e9b206

Turn off all failing protocol tests

fb3a52a

PR feedback and fix breaking changes

40bb298

Merge branch 'main' into flexible-checksums

828adaa

This comment has been minimized.

Sign in to view

Trigger CI

e2068e7

This comment has been minimized.

Sign in to view

Drop support for http body dot bytes response checksums

08b4a37

This comment has been minimized.

Sign in to view

0marperez marked this pull request as ready for review December 4, 2024 20:53

0marperez requested a review from a team as a code owner December 4, 2024 20:53

Fix HttpChecksumRequiredTrait

91355d1

This comment has been minimized.

Sign in to view

Fix kotlin writer runtime exception

1fcd4b2

Merge branch 'main' of https://github.com/awslabs/smithy-kotlin into …

dc3ce8b

…flexible-checksums

This comment has been minimized.

Sign in to view

Use toList supported for JVM versions less than 16

3ef1c20

This comment has been minimized.

Sign in to view

lauzadis previously requested changes Dec 19, 2024

View reviewed changes

lauzadis reviewed Dec 19, 2024

View reviewed changes

PR feedback

7116221

This comment has been minimized.

Sign in to view

0marperez added acknowledge-artifact-size-increase acknowledge-api-break Acknowledge that a change is API breaking and may be backwards-incompatible. Review carefully! labels Dec 24, 2024

This comment has been minimized.

Sign in to view

Change JVM version

344f118

This comment has been minimized.

Sign in to view

Clean up

c689ff5

This comment has been minimized.

Sign in to view

misc: revert toList/JVM compatibility changes

9beca23

This comment has been minimized.

Sign in to view

ianbotsf requested changes Jan 9, 2025

View reviewed changes

fix: pr feedback v1

aa714d9

This comment has been minimized.

Sign in to view

0marperez added 2 commits January 13, 2025 12:07

fix: pr feedback v2 (get rid of caching non bytes http bodies)

69b7765

misc: merge from main

399e87d

This comment has been minimized.

Sign in to view

ianbotsf approved these changes Jan 13, 2025

View reviewed changes

fix: pr feedback v3

f4c7e17

0marperez merged commit e0c25d6 into main Jan 15, 2025
16 checks passed

0marperez deleted the flexible-checksums branch January 15, 2025 17:35

		userProviderChecksumHeader(context.protocolRequest, logger)?.let {
		logger.debug { "User supplied a checksum via header, skipping checksum calculation" }

		// FIXME: Re-enable. This test is broken after a smithy update: https://github.com/smithy-lang/smithy/pull/2467
		// ProtocolTest("aws-json-10", "aws.protocoltests.json10#JsonRpc10"),

		@@ -0,0 +1,40 @@
		package aws.smithy.kotlin.runtime.client.config

		}

		logger.debug { "Checksum wasn't provided, selected, or isn't required: skipping checksum calculation" }

		public open fun ignoreChecksum(checksum: String): Boolean = false
		public open fun ignoreChecksum(checksum: String, logger: Logger): Boolean = false

feat: support default checksums #1191

feat: support default checksums #1191

Conversation

0marperez commented Nov 27, 2024

Issue #

Description of changes

This comment has been minimized.

This comment has been minimized.

0marperez commented Nov 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lauzadis Nov 27, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

0marperez Dec 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

0marperez Dec 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

0marperez Dec 19, 2024 • edited Loading

Choose a reason for hiding this comment

lauzadis Dec 19, 2024 • edited Loading

Choose a reason for hiding this comment

This comment has been minimized.

lauzadis Nov 27, 2024 •

edited

Loading

0marperez Dec 2, 2024 •

edited

Loading

0marperez Dec 19, 2024 •

edited

Loading

0marperez Dec 19, 2024 •

edited

Loading

lauzadis Dec 19, 2024 •

edited

Loading